2 PATENT - AMENDMENT AFTER FINAL 

Response Under 37 C.F.R. 1.116 - Expedited 
Procedure - Examining Group 2615 

Amendments to the Claims 

Please amend claims 1, 5, 9 and 1 1. Please cancel claims 20-27. The currently pending 
claims after amendment are listed below. 



1 1 . (Currently Amended) A digital camera, comprising: 

2 a housing; 

3 a digital optical sensing apparatus mounted within said housing, said digital optical sensing 

4 apparatus sensing optical images; 

5 a storage medium for storing digital optical images captured by said digital optical sensing 

6 apparatus; 

7 an acoustic sensor capable of sensing human speech; 

8 a speech reduction apparatus coupled to said acoustic sensor, said speech reduction 

9 apparatus converting human speech sensed by said acoustic sensor to a symbolic text form; and 

10 a controller which stores said symbolic text form in said storage medium in a relationship 

1 1 associated with a captured digital image, wherein said controller: 

12 (a) receives a user indication of a plurality of discrete time intervals; 

13 (b) records a plurality of discrete human speech segments sensed by said acoustic 

14 sensor in respective said discrete time intervals; 

1 5 (c) causes said speech reduction apparatus to convert each said human speech 

16 segment to a corresponding symbolic text segment; and 

17 (d) automatically associates a respective digital optical image captured by said digital 

1 8 optical sensing apparatus with each said symbolic text segment based on a temporal 

19 relationship between the time interval in which the discrete human speech segment 

20 corresponding to the symbolic text segment was recorded and the capturing of said digital 

2 1 optical image; 
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22 wherein said controller associates a respective digital image with each symbolic text 

23 segment according to all of the following association priorities: 

24 (1) if a first digital image is captured during the recording of a human speech segment 

25 corresponding to the symbolic text segment, the symbolic text segment is associated with 

26 the first digital image; 

27 (2) if no digital image is captured from a time the digital camera is powered on until 

28 the end of the recoding of the human speech segment corresponding to the symbolic text 

29 segment, and a second digital image is captured after recording the human speech segment 
• 30 but before the digital camera is powered off then the symbolic text segment is associated 

31 with the second digital image; and 

32 (3) in all other cases, the symbolic text is associated with the digital image most 

33 recently captured before the recording of the human speech segment corresponding to the 

34 symbolic text segment . 

1 2. (Original) The digital camera of claim 1 ? wherein said controller comprises a 

2 programmable processor executing a control program for controlling the operation of said digital 

3 camera. 

1 3. (Original) The digital camera of claim 2 5 wherein said speech reduction apparatus 

2 comprises a speech reduction algorithm embodied as a plurality of instructions executable on said 

3 programmable processor. 

1 4. (Original) The digital camera of claim 1 , wherein said speech reduction apparatus 

2 converts said human speech sensed by said acoustic sensor to an intermediate symbolic form 

3 comprising a symbolic representation of phonemes, said intermediate symbolic form being 

4 subsequently reduced to natural language text by a separate apparatus. 
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1 5. (Currently Amended) A method of operating a digital camera, comprising the steps of: 

2 capturing a plurality of digital images of respective objects of interest with optical sensing 

3 apparatus of said digital camera; 

4 recording a plurality of discrete segments of human speech of a user in said digital camera 

5 during a plurality of respective discrete time intervals, each respective discrete time interval 

6 occurring substantially contemporaneously with capturing of each respective digital image of said 

7 plurality of digital images; 

8 rendering each said segment of said plurality of discrete segments of human speech in a 

9 respective corresponding segment of symbolic text using speech reduction apparatus within said 

10 digital camera; and 

1 1 automatically associating each respective digital image of said plurality of digital images 

12 with a respective corresponding segment of symbolic text rendered from a respective 

13 corresponding segment of human speech based on a temporal relationship between the respective 

14 discrete time interval during which the corresponding segment of human speed was recorded and 

1 5 the capturing of the respective digital image, and storing each said symbolic text segment in a 

16 relationship associated with each respective said captured digital image; 

17 wherein said step of automatically associating each respective digital image of said 

18 plurality of digital images with a respective corresponding segment of symbolic text comprises 

19 automatically associating according to all of the following association priorities: 

20 (I) if a first digital image is captured during the recording of a human speech segment 

21 corresponding to the symbolic text segment, the symbolic text segment is associated with the first 

22 digital image; 

23 (2) if no digital image is captured from a time the digital camera is powered on until the 

24 end of the recoding of the human speech segment corresponding to the symbolic text segment, 

25 and a second digital image is captured after recording the human speech segment but before the 
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26 digital camera is powered off, then the symbolic text segment is associated with the second digital 

27 image; and 

28 (3) in all other cases, the symbolic text is associated with the digital image most recently 

29 captured before the recording of the human speech segment corresponding to the symbolic text 

30 segment . 

1 6. (Previously Presented) The method of operating a digital camera of claim 5, wherein said 

2 step of rendering each said segment of said plurality of discrete segments of human speech in a 

3 respective corresponding segment of symbolic text converts said human speech to an intermediate 

4 symbolic form comprising a symbolic representation of phonemes, said intermediate symbolic 

5 form being subsequently reduced to natural language text by an apparatus separate from said 

6 digital camera. 

1 7. (Previously Presented) The method of operating a digital camera of claim 5, wherein said 

2 step of rendering each said segment of said plurality of discrete segments of human speech in a 

3 respective corresponding segment of symbolic text is performed by a programmable processor 

4 executing a speech reduction program. 

1 8. (Previously Presented) The method of operating a digital camera of claim 7 5 wherein said 

2 programmable processor further executes a control program for controlling the operation of said 

3 digital camera, and said step of rendering each said segment of said plurality of discrete segments 

4 of human speech in a respective corresponding segment of symbolic text is performed by said 

5 programmable processor in the background when said control program is otherwise unoccupied. 
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1 9. (Currently Amended) A program product for controlling the operation of a digital camera, 

2 said program product comprising a plurality of processor executable instructions recorded on 

3 signal-bearing media, wherein said instructions, when executed by at least one programmable 

4 processor within said digital camera, cause the camera to perform the steps of: 

5 capturing a plurality of digital images of respective objects of interest with optical sensing 

6 apparatus of said digital camera; 

7 recording a plurality of discrete segments of human speech of a user in said digital camera 

8 during a plurality of respective discrete time intervals, each respective discrete time interval 

9 occurring substantially contemporaneously with capturing of each respective digital image of said 

1 0 plurality of digital images; 

1 1 rendering each said segment of said plurality of discrete segments of human speech in a 

12 respective corresponding segment of symbolic text using speech reduction apparatus within said 

1 3 digital camera; and 

14 associating each respective digital image of said plurality of digital images with a 

1 5 respective corresponding segment of symbolic text rendered from a respective corresponding 

16 segment of human speech based on a temporal relationship between the respective discrete time 

17 interval during which the corresponding segment of human speech was recorded and the capturing 

1 8 of the respective digital image, and storing each said symbolic text segment in a relationship 

19 associated with each respective said captured digital image; 

20 wherein said step of associating each respective digital image of said plurality of digital 

21 images with a respective corresponding segment of symbolic text comprises associating according 

22 to all of the following priorities: 

23 (I) if a first digital image is captured during the recording of a human speech segment 

24 corresponding to the symbolic text segment, the symbolic text segment is associated with 

25 the first digital image; 

26 (2) if no digital image is captured from a time the digital camera is powered on until 
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27 the end of the recoding of the human speech segment corresponding to the symbolic text 

28 segment, and a second digital image is captured after recording the human speech segment 

29 but before the digital camera is powered off, then the symbolic text segment is associated 

30 with the second digital image; and 

31 (3) in all other cases, the symbolic text is associated with the digital image most 

32 recently captured before the recording of the human speech segment corresponding to the 

33 symbolic text segment . 

1 1 0. (Previously Presented) The program product for controlling the operation of a digital 

2 camera of claim 9, wherein said step of rendering each said segment of said plurality of discrete 

3 segments of human speech in a respective corresponding segment of symbolic text converts said 

4 human speech to an intermediate symbolic form comprising a symbolic representation of 

5 phonemes, said intermediate symbolic form being subsequently reduced to natural language text 

6 by an apparatus separate from said digital camera. 

1 11. (Currently Amended) A method of recording information with digital images, comprising 

2 the steps of: 

3 capturing a plurality of digital images of respective objects of interest with optical sensing 

4 apparatus of a digital camera; 

5 recording a plurality of discrete segments of human speech of a user in said digital camera 

6 during a plurality of respective discrete time intervals occurring substantially contemporaneously 

7 with capturing of each respective digital image of said plurality of digital images; 

8 rendering each said segment of said plurality of discrete segments of human speech into a 

9 respective corresponding segment of symbolic text using speech reduction apparatus within said 
10 digital camera; 
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1 1 automatically associating each respective digital image of said plurality of digital images 

12 with a respective corresponding segment of symbolic text rendered from a respective 

13 corresponding segment of human speech based on a temporal relationship between the respective 

14 discrete time interval during which the corresponding segment of human speech was recorded and 

1 5 the capturing of the respective digital image, and recording said association in a memory of said 

16 digital camera; 

17 uploading said at least one digital image and said at least one segment of symbolic text to a 

18 digital image formatting apparatus; and 

19 formatting said plurality of digital images and said plurality of segments of symbolic text 

20 for viewing by a user using said digital image formatting apparatus, wherein each said segment of 

21 symbolic text is formatted for viewing in a human readable form associated with its 

22 corresponding digital image; 

23 wherein said step of automatically associating each respective digital image of said 

24 plurality of digital images with a respective corresponding segment of symbolic text comprises 

25 automatically associating according to at least one of the following priorities: 

26 (I) if a first digital image is captured during the recording of a human speech segment 

27 corresponding to the symbolic text segment, the symbolic text segment is associated with the first 

28 digital image; 

29 (2) if no digital image is captured from a time the digital camera is powered on until the 

30 end of the recoding of the human speech segment corresponding to the symbolic text segment, 

31 and a second digital image is captured after recording the human speech segment but before the 

32 digital camera is powered off, then the symbolic text segment is associated with the second digital 

33 image; and 

34 (3) in all other cases, the symbolic text is associated with the digital image most recently 

35 captured before the recording of the human speech segment corresponding to the symbolic text 

36 segment . 
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37 12. (Previously Presented) The method of recording information with digital images of claim 

38 11, wherein said step of rendering each said segment of said plurality of discrete segments of 

39 human speech in a respective corresponding segment of symbolic text converts said human 

40 speech to an intermediate symbolic form comprising a symbolic representation of phonemes, and 

41 wherein said step of formatting said plurality of digital images and said plurality of segments of 

42 symbolic text for viewing comprises reducing said intermediate symbolic form to natural 

43 language text. 

1 13. (Original) The method of recording information with digital images of claim 1 1 , wherein 

2 said digital image formatting apparatus is a general-purpose digital computer executing a digital 

3 image formatting program. 

1 14. (Previously Presented) The method of recording information with digital images of claim 

2 11, wherein said step of formatting said plurality of digital images and said plurality of segments 

3 of symbolic text comprises formatting for output on paper, wherein formatted text is printed on 

4 paper with a corresponding digital image. 

1 15. (Previously Presented) The method of recording information with digital images of claim 

2 11, wherein said step of formatting said plurality of digital' images and said plurality of segments 

3 of symbolic text comprises formatting for viewing from an output screen of a digital device, 

4 wherein formatted text is displayed on said output screen with a corresponding digital image. 

16-27. (Cancelled) 
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